Pull-based load distribution among heterogeneous parallel servers: the case of multiple routers

نویسنده

  • Alexander L. Stolyar
چکیده

The model is a service system, consisting of several large server pools. A server processing speed and buffer size (which may be finite or infinite) depend on the pool. The input flow of customers is split equally among a fixed number of routers, which must assign customers to the servers immediately upon arrival. We consider an asymptotic regime in which the customer total arrival rate and pool sizes scale to infinity simultaneously, in proportion to a scaling parameter n, while the number of routers remains fixed. We define and study a multi-router generalization of the pull-based customer assignment (routing) algorithm PULL, introduced in [10] for the single-router model. Under PULL algorithm, when a server becomes idle it send a “pull-message” to a randomly uniformly selected router; each router operates independently – it assigns an arriving customer to a server according to a randomly uniformly chosen available (at this router) pull-message, if there is any, or to a randomly uniformly selected server in the entire system, otherwise. Under Markov assumptions (Poisson arrival process and independent exponentially distributed service requirements), and under sub-critical system load, we prove asymptotic optimality of PULL: as n → ∞, the steady-state probability of an arriving customer experiencing blocking or waiting, vanishes. Furthermore, PULL has an extremely low router-server message exchange rate of one message per customer. These results generalize some of the single-router results in [10].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Analysis of a Pull-Based Parallel Video Server

ÐIn conventional video-on-demand systems, video data are stored in a video server for delivery to multiple receivers over a communications network. The video server's hardware limits the maximum storage capacity as well as the maximum number of video sessions that can concurrently be delivered. Clearly, these limits will eventually be exceeded by the growing need for better video quality and la...

متن کامل

Vacation model for Markov machine repair problem with two heterogeneous unreliable servers and threshold recovery

Markov model of multi-component machining system comprising two unreliable heterogeneous servers and mixed type of standby support has been studied. The repair job of broken down machines is done on the basis of bi-level threshold policy for the activation of the servers. The server returns back to render repair job when the pre-specified workload of failed machines is build up. The first (seco...

متن کامل

Power-efficient Load Distribution in Heterogeneous Computing Environments

High performance servers of heterogeneous computing environments, as can be found in data centers for cloud computing, consume immense amounts of energy even though they are usually underutilized. In times when not all computing capabilities are needed the task to be solved is how to distribute the computational load in a power-efficient manner. The question to be answered is, what load partiti...

متن کامل

Adaptive Load Sharing in Video-On-Demand Distributed Multimedia System Using Parallel Servers

This Video data is stored in a video server for delivery to multiple receivers over a communications network in a traditional video-on-demand system. The hardware in this video server has the following limitations: 1) the maximum storage capacity, and 2) the maximum number of video sessions that can simultaneously be delivered. We propose a novel Adaptive-Multiple-VIdeo-Server -Architecture (AM...

متن کامل

Dynamic Load Balancing in Geographically Distributed Heterogeneous Web Servers

With ever increasing Web traac, a distributed multi-server Web site can provide scalability and ex-ibility to cope with growing client demands. Load balancing algorithms to spread the requests across multiple Web servers are crucial to achieve the scalability. Various domain name server (DNS) based schedulers have been proposed in the literature, mainly for multiple homogeneous servers. The pre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Queueing Syst.

دوره 85  شماره 

صفحات  -

تاریخ انتشار 2017